WO 2005/049790 



1/39 




B 




DIDLID 

(14 aa; 71%) 



c 



340a a 



AV- 



IV 



DNA-bindlng 
+~4 » 




S-Nrfl, 2 



CNC BR 

(43 aa; 23%) 



WWTRTCAT 



GCNNNGTGA 



SKN- 1 (monomer) 
ARE (dimer) 



Fig. 1A-1C 



WO 2005/049790 



BEST AVAILABLE COPY 

PCT/US2004/01 9046 

2/39 



skn-1(+) 





. "... ; 



skn-1(zu67) 




A MESH r . > j •• -J ••>'. «n> if j f i t f'- ■ ' . 



.-'!..» ' ' ; -J *|-- < I 1 1 ,hi . 



, |- I If Ml* !■ .."1 ■**+•?.-. I *■ *— ■ . " t .t -"f * 




Fig. 2A-2L 



WO 2005/049790 



BEST AVAILABLE COPY 



PCT/US2004/019046 



3/39 



A 



Pharynx-specific Intestine and AS1 

/ — K — v A -> 



gcs-l::gfp 
gcsLl::gfp 
gcsA2::gfp 
gcs£J::gfp 

gcsL4::gfp 
gcsA5::gfp 



gcsA2;nruil::gfp 
gcsA2;mut2;:gfp 
gcs&2pnut3::gfp 



-1840- 



-879 



-682 



-682 



1 



2 



3+1 



Intestine ASI Pharynx 



-336 4- H 



-98 - 



■H 



+ 



+ 



+ 



— i — 

-l 000 -800 -600 -400 -200 



+++ 



nd 
od 



nd 
nd 



B 



gcs-1::gfp 



gcsA2:;gfp 



gcs&2; 
mut3::gfp 







Nomarski 



GFP 



GFP + Dll 



^ med-l 
med-2 



-124 
-127 
-127 



CI 



CI2J3 



CI C T 3 TCATCATGA 1 3ATT1 ITG G -Jck Tl S T : M CA 



TCATCATG AhtaATTTt TT^teKG -ICfijTl 



-TTTC3-B3 
XTTC1-83 
-83 



OMULS*! 



Fig. 3A-3C 



BEST AVAILABLE COPY 

WO 2005/049790 PCT/US2004/019046 

4/39 



A 



SKN-I 



Competitor 
WT Mutant 



IT 



ir 



lysatr 



'lysattr 



SKN-I 
Bkgd 



«»•» — 




I 2 3 4 5,6 7 8 9 10. .11 12 1314 IS 16 17 18 19 20. 
Wild type Mutant WOdtype 



B 



SKN-1 




lysate 



Competitor a 
WT Mutant 




SKN-l 

Free 
DNA 




W 



wa» ASP 




t 1 2 3 4 3„ 6 7 8 9 10. , 11 12 13 14 IS 16 17 18 1920. 
Wild type Mutant Wild type 



Fig. 4A-4B 



WO 2005/049790 



BEST AVAILABLE COPY 



5/39 



PCTYUS2004/019046 



a. genomic 



Sphl 

b.SKN-1::GFP i — : 



Sphl 

c. SknPro::GFP < — 



EcoRI 
i 



mm — ohm 


1 ■ ,-''H 

• 









EcoRI 



Ikb 




..." T> M:^^>^^„: • 

!i.. t " i.i.i i 



^3 



f • . • " i 





*• i. . 
i is- 



"•if 1 -. 



I 









Fig. 5A-5G 



BEST AVAILABLE COPY 

WO 2005/049790 PCT/US2004/019046 

6/39 




0 5 10 . 15 20 25 30 



Days 



Fig. 6A-6B 



BEST AVAILABLE COPY 

WO 2005/049790 PCT/US2004/019046 

7/39 



Honed genes 



Ho dels 

M176.2 

glutathione synthase 



YRCs ft Cosnlds 

K176 

4 



The Ml 76.2 gene is located on chromosome TL Regulatory sequences can be found e.g., in the 
region between 9416340 and 9415915. An exemplary sequence of this region is as follows: 

* * 

\ * 

\ 

GACAATTATCGATTAATAAAAGTTTAAA&^CACGA 

♦ 

GTTGTTTTGTGTG TAGAAAAATAATTTTGATAGAAACAAAAAATTAGCGTAAAAT 

. 

TACTCGTGCACGAGATGTGCGCCAGCAGCTCCTTGAGGC^ 

-378 

: 

TGCTCTTTGAGTTCTTTGTTTTCGGGAGCAAATTTCATGCCAATCCCTTTCTTTTTTTCAAATTTTCCTG 

■ 

TTn&ATTf!ATG TAAT AACT AT T AT TC AT G T CAATTACAACAAAT AAGCATCCAAGAT TT TAT CATA AACT 

-243 

• • . 
CGTTCAAACCTCCTTTTACCACTCGAAAAGCAATATCTCCGACTTCCTTCAAAGAGAAAT 

-169. 

CATAGAAACCTCACGTTATACGT TTTGTCATG ACGATTTC^GTGCTCACTTTTCTCATTTCATTCTGGCT 

-137 

TAATTTCATTTTTGTCACTCTCGCGTCATGTTTTGCATTTTTCGAAAGCATTTATTTAAAACTGAAAAAA 
TAATTCGTAATTTTTCAAGAATGGCT 



« 
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Exemplary Ml 76.1 sequences: 





MAQKD DRILL 
ASLYHFIAYE 
MGAHALRLTE 
LGVPMTCIIR 
KTQQVLSEDG 
LPEEERGAFI 
VDTPFLYEFI 



LNAPRLPLED 
FDFLIDIHKN 
WHIRVLKALN 
RNLTQCYEQL 
VLERFIGKPR 
LMEKLKPMII 



DKLNELTADL 
WKTDDFTRN 
ISDDVIQRAI 
SLNDRSDLMI 
EARDIRASFA 
ENYLVLAKKP 



HDWAHANGLV 
MVEILKKVKA 
PENKPIPMIA 
DGRQVAIVYF 
GMWALENTDE 
ITFAKAVSEL 



MRLSTDKLSS 
QGLKQPVTLA 
EALFKAWSHF 
RAGYSPDHYP 
VTMKWAGAQ 
GVYGYAFGRK 



EVCQTTPLTL 
IQRSDYMCHK 
SNPAAWLW 
STKE WEARER 
KHPEAFVLKP 
DAPELKTAGH 



LPSPFPKNVF 
DQYSAEYGLK 
VENVNQNQID 
MELSTAIKTP 
QTEGGAALHT 
LLRTKPESTA 



EEAVHIQNLF 
QIEINNIASS 
QRHVEYELEK 
WIGLQVANTK 
GDEMVQMLRE 
MGGVAAGHAV 






aaagaATGGCT CAAAAAGATG ACCGGATTTT GCTGTTGAAT GCTCCAAGGC TCCCGCTCGA AGATGATAAG CTCAACGAGC 
TCACCGCTGA TCTTCACGAT TGGGCTCATG CTAATGGGCT TGTCATGCGT CTATCAACCG ACAAGTTGAG CAGGGAAGTT 
TGT CAAACTA CTCCATTAAC ACTTCTTCCA TCTCCATTCC CGAAAAATGT TTTTGAAGAA GCAGTTCATA TTCAGAACCT 
TTTCGCAAGT CTTTATCACT TCATAGCTTA TGAATTTGAT TTTCTAATCG ATATTCATAA AAATGTCGTG AAAACTGATG 
ATTTCACACG GAATATGGTT GAGATCTTGA AGAAAGTCAA AGCCCAAGGA CTCAAGCAAC CAGTCACTCT CGCGATTCAA 
CGATCTGATT ATATGTGTCA TAAGGATCAA TATTCAGCGG AATATGGACT GAAACAAATT GAAATAAACA ATATCGCCTC 
GTCAATGGGA GCACATGCTC. TACGGCTCAC CGAATGGCAT ATCAGAGTTC TTAAAGCGTT GAACATTTCC GATGACGTCA 
TTCAAAGAGC AATTCCAGAA AACAAGCCAA TTCCAATGAT CGCTGAAGCT TTATTCAAGG CCTGGTCX:CA CTTTTCGAAC 
CCAGCAGCTG TGGTTCTTGT CGTTGTAGAA AACGTCAATC AAAATCAGAT TGATCAACGC CACGTGGAAT ATGAACTTGA 
AAAGTTAGGA GTACCGATGA CATGTATTAT TAGAAGAAAT TTAACACAAT GCTATGAACA ATTATCATTG AATGATAGAA 
GCGATTTGAT GATTGATGGG CGTCAAGTAG CAATTGTTTA CTTCAGAGCA GGATACTCAC CTGATCATTA TCCATCTACA 
AAAGAATGGG AAGCACGTGA GCGTATGGAA CTTTCCACCG CTATCAAAAC TCCATCGATC GGGCTACAGG TGGCAAATAC 
TAAGAAGACC CAGCAGGTTC TTTCTGAAGA TGGAGTACTC GAAAGATTCA TCGGAAAACC ACGAGAAGCT CGCGATATTC 
GAGCTTCATT CGCAGGAATG TGGGCTTTGG AGAACACTGA TGAAGTGACT ATGAAAGTCG TGGCTGQAGC TCAAAAACAT 
CCAGAAGCGT TTGTTCTGAA GCCACAAACT GAAGGTGGAG CCGCATTGCA CACCGGTGAT GAGATGGTTC AAATGCTCCG 
AGAACTTCCG GAAGAAGAGC GTGGAGCTTT CATTTTGATG GAGAAACTGA AACCGATGAT TATTGAAAAC TACCTGGTTC 
TTGCAAAGAA GCCGATCACA TTTGCTAAGG CTGTTAGTGA ACTTGGAGTG TATGGTTATG CATTTGGAAG GAAGGATGCA 
CCTGAGCTTA AGACTGCTGG GCATTTGCTC CGAACGAAAC CGGAATCCAC AGCTATGGGT GGAGTAGCCG CCGGACATGC 
TGTTGTCGAC ACCCCATTCC TCTACGAATT TATTTGAttt cgaacataat cagaaaactc aacaaaaatg ctgtgatatg 
aaaccatttg ctatttagat ctttttgtgt ttgtaaattt aatcattgta atttattgaa tgt 
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The F39B2.3 gene is located on chromosome I. Regulatory sequences can be found e.i*. 9 in the 
region between 14 797 521 and 14 798 310. An exemplary sequence of this region is as follows: 

(^TTTTGAAAGTGCCCAAGTTGCTGGAACGCTGAAAATTGAAATTATT^ 
TCCGAAJU^TCAAGAAAAAATCGATAATTTCGTCGGACAATCCGCCTC 

TGCCGCGCGCTCATTCGATTTGTGTGAGTGCCCAGTGGAGCGCGTTTGCTAAGGCTAACTGTGTAGTCCT 

CTCGGACAAGATCTGTGAACATTGAAATGAAACACTTGG * 

-518 

■ 

TTTTGTTTGCGACCGAAAAAAAATTATAAAAATTGAATATT GGTTATCATC GTTTCAATCTTTGTTTTGT 

. -469 

* 

ATTAAAGGCACAGCTGCTAAAAATTGTTTTTTTTTTTTCAATTTTGCTAAAAGAAAATCAATTTTCTGAT 
TTTTTGTTGAGTTCCCGTGCAAATCAATGTCCTAGCTTTTTAAAATTGTTTTTTGTTATGTAATTCTAAT 
GAAATTTTGTCGAATTTTGAGAGATTTTCTGCTAAAAC^ " 
AAACATTTACTCAAACCTTTTACGGAAAAATGAAACAAAAGT^ 

♦ » 

CTGAACGCGTACTTAAAGGTACACGGTTTGATTCGGATTGGTCCCGCCACAAAGTGTTACCATAACATTT 
TTCTCGCTGCGAGACCC^TCCGAATAAATCCGTGCGCCTAATCAGTGCGAGTACGCATTTCATATTAGTG 

* 

ATAAGTGCCATTTTTAGAACAATG 
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Exemplary F39B2.3 sequences: 






MSKSICKSSM RAAWRRFGA PDVIEAVESD MPRLEKNQVL VRNYAAGVNP VDTYIRAGQY GKLPNLPYVP GKDGAGEVEL 

VGESVKNVKV GDRVWYGSEA DSTAEYVAVN RPFELPEGVS FEEGASLGVP YLTAYRALFH LAGAKTGDVI LVHGASGGVG 

SALMQLAAWR NIEAVGTAGS ADGIRFVKSL GARNVYKHSD KQYVSKMKND YPGGFNHIFE MAAHTNLNTD LGLLAPRGRV 

AVIGNRAETT INARQLMVTE GAVYGVALGM SSEAELLDPG INIVSFLKET EFRPLINKLY RLEQLGLAHE EIMNNKGAKG 
NLWQIEH 




ATGAGCAAAT 
.CGAGAGTGAT 
ATATTCGTGC 
GTGGGAGAAA 
TGCGGTGAAT 
CTTATCGTGC 
AGTGCACTGA 
GAAGAGTCTT 
GCT-TCAACCA 
GCAGTAATTG 
ATTGGGAATG 
CACTTATAAA 
AATCTTGTAG 



CGATTTGCAA 

ATGCCCAGGC 

TGGTCAGTAT 

GCGTTAAAAA 

CGACCATTCG 

ATTGTTTCAT 

TGCAGCTGGC 

GGTGCACGGA 

CATTTTCGAA 

GAAATCGCGC 

TCTTCCGAGG 
CAAATTGTAT 

TGCAAATCGA 



ATCAAGCATG 
TTGAAAAAAA 
GGAAAACTAC 
TGTGAAAGTC 
AGTTGCCGGA 
CTTGCTGGTG 
TGCCTGGAGG 
ATGTCTATAA 
ATGGCTGCTC 
CGAGACCACG 
CTGAGCTCTT 
CGTCTCGAGC 
ACATTAAttC 



CGCGCAGCTG 
CCAGGTTCTC 
CAAATCTTCC 
GGCGATCGAG 
AGGAGTTTCG 
CAAAGACTGG 
AACATTGAAG 
TCATTCGGAT 
ACACAAATCT 
ATCAACGCAA 
GGACTTTGGC 
AATTAGGACT 
attattttaa 



TAGTCCGACG 
GTTCGGAATT 
ATATGTACCA 
TCTGGTATGG 
TTTGAGGAAG 
CGACGTTATA 
CTGTTGGCAC 
AAGCAATATG 
GAACACGGAC 
GACAACTTAT 
ATCAACATTG 
GGCTCATGAG 
cacgcca"ttt 



ATTCGGAGCA 
ACGCTGCCGG 
GGAAAAGATG 
ATCAGAAGCG 
GAGCTTCTCT 
CTTGTACACG 
TGCTGGATCT 
TGTCGAAAAT 
CTCGGATTGC 
GGTTACAGAA 
TCTCATTCTT 
GAAATTATGA 



CCTGATGTCA TAGAAGCCGT 
TGTCAATCCA X3TTGACACAT 



GACAGTACAG 
CGGAGTGCCT 
GAGCATCTGG 
GCTGATGGGA 
GAAAAATGAT 
TGGCTCCACG 
GG AGCTGTTT 
GAAGGAAACC 
ACAACAAGGG 



CAGAGTATGT 
TATCTTACCG 
TGGAGTGGGA 
TGCGGTTCGT 
TATCCAGGAG 
TGGTAGAGTT 
AGGGTGTAGC 
GAGTTTCGTC 
AGCGAAAGGA 
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The R03D7.6 gene is located on chromosome II. Regulatory sequences can be found e^g., in the 
region between 10949088 and 10950317. An exemplary sequence of this region is as follows : 



AGAACTTTTCGAGAAGTCTACCGTTGTAGTTTTCGAAATAGTAATTTATTTAGTGACGTTTATAAAGQTTTACATGATTT- 
GGTTTGGAAATT TTTTAGGAGT T TATTCATAAAAACAAAGTAACCATGGACATTCCAGAAGTCTAT AGT ACACGCGAT^ 
TACCGTACCCTTCAGTATTTCTATCAGATTGATAGCTTTCGGTAGTCAGGTACAGCCTAAAAAATTCCTGCTTGCCTTTT 
TGCCTACATGTCTGCCTACCTTCAGTCATAATGCCTACATAATG 

-947 . 

ATTTTTTCCAATTGAAACTTGGAGACAGAAATTGAAATG 

» m 

CATTTCTTTTCATATCAGTTTTCCTGTCAAAGCACATTTCTGGAGTCTGTGTGTATTTTTTTGTGTCTTTATGTGATCGG 
TGTTGTGAAATTTGTAGTTGATGTTGATAACATACTTCT 

TATGTTCGTGATACTTTGCGATTCTCC^GCCAAAAACACGGTATCCGGTCTCGACACGACAACTT 
rGCTGATGTGCGCCTTGAAAGAGTACTGTAATTTCAACCTTTCGTTC 

AT G TAT T T AT T AAACAAAAAAC T AAAAC AAAACAAT T G AGAAC ACAT AAAT TGT G AAAAATCAATG AG ACCACAGC AAAA 
AATTTT GT ATCTAG AGT ACTCT T T AAAGGCGCACATCCGTTCT TATTTTCAGCAAAAATGTCGCT TCGAGACCGGGTACC 
GTATTTTTTTTTGTGCAAAACTTTAGGTCTAGGTAATATTAAAAAAAAATTCCACAAAAC^ 

TAAATTTTTT GATOA^TO^ 

-302 -282 

ACTGAATTCTCTTTCCGAAAGACCACCACAATTTCAGGGCTCCGCCCATTTCGTGGTTTC 
T T T T GAT GACAAT TGT G AG AG AAG TGAGAGG T T CAG ACAC AAAAAGCGACGTGGTCGAATGA 

-149 

■ • 

GTATAAATAGAGAGTGAAGTTTCCAATTTCCCTC^CAATTGTTTGTTTGCAATC 
TCAAAAATCATTATGGTT 



FIG. 11 



WO 2005/049790 



BEST AVAILABLE COPY 



PCT/US2004/019046 



12/39 



Exemplary R03D7.6 (gst-5) sequences: 




MVSYKLTYFN GRGAGEVSRQ IFAYAGQQYE DNRVTQEQWP ALKETCAAPF GQLPFLEVDG KKLAQSHAIA RFLAREFKLN 
GKTAWEEAQV NSLADQYKDY SSEARPYFYA VMGFGPGDVE TLKKDIFLPA FEKFYGFLVN FLKASGSGFL VGDSLTWIDL 
AIAQHSADLI AKGGDFSKFP ELKAHAEKIQ AIPQIKKWIE TRPVTPF 




ATGGTTTCCT 
ACAATACGAG 
CATTCCTCGA 
<3GAAAAACCG 
TTTCTACGCT 
TCTACGGATT 
GCTATTGCCC 
GAAGATCCAG 



ACAAGTTGAC 
GATAATAGAG 
AGTCGACGGT 
CCTGGGAAGA 
GTCATGGGAT 
CTTGGTCAAC 
AACATTCAGC 
GCGATTCCAC 
tatttttttt 



CTACTTCAAT 
TCACTCAGGA 
AAGAAGCTTG 
GGCTCAAGTG 
TCGGTCCAGG 
TTCTTGAAGG 
TGATTTGATT 
AAATCAAGAA 
tttt 



lice 

GGACGTGGCG 
ACAATGGCCA 
CTCAATCCCA 
AACTCTCTTG 
AGACGTTGAA 
CTTCGGGATC 
GCCAAGGGAG 
ATGGATCGAG 



CTGGAGAAGT 
GCATTGAAAG 
CGCGATTGCT 
CCGATCAATA 
ACTTTGAAGA 
CGGATTCCTT 
GTGATTTCAG 
ACCCGTCCAG 



GTCTCGTCAG 
AAACCTGCGC 
GGTTTCTTGG 
CAAGGATTAT 
AAGACATCTT 
GTCGGAGACT 
CAAGTTCCCA 
TCACACCATT 



ATTTTCGCCT 
TGCTCCATTC 
CTCGTGAGTT 
TCAAGTGAGG 
CCTTCCAGCA 
CTTTGACCTG 



ATGCCGGACA 
GGACAACTTC 
CAAGCTCAAC 
CTCGTCCATA 
TTTGAAAAGT 
GATTGACTTG 
CTCATGCCGA 
CTAAatagct gtataaaatc 
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The F35E8.8 gene is located on chromosome V. Regulatory sequences can be found e.g., in the 
region between 15 917 841 and 15 91 8 925, An exemplary sequence from in or around this region 
is as follows: 



TC TCATTCT CTT CAAG ACATAACACAACGGGCTGACGACCATATCATCAACGACGATTTTTT AGGAACTG 
TACTTTATCTGTGTCTGACCAACACGTGTGAATGAAGTTTCAACTGGAAAATTTGTTTGAAACACTGCAA. 
AGAATTTCGAATTTT GATGATAATT TTAAATGCCATTATCAGTTTTAATACGCCACTCTAGTCTTTGATT 

-240 

CTTTGCACACACACACA<^CACACACACAC^^ 

CTGATTTAACGAGAAAACATTTG ATGACAAT A AACTTGGCGT AT TAATATAAAAGGGAAAATTCAAT TCA 

-94 

* 

GATTCTCAACGGTTTATTTTCTGTCACAACTCTTCCTAATATTCACCATGGTTT ' 
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Exemplary F35E8.8 (gst-38) sequences: 




MVSYKLTYFD GRGAGELCRQ IFAAAEQKYE DNRLTDEEWE KFKAAGKTPY NQLPMLEVDG KPIAQSHAMA RYIAREFGEU 
GKSRWEEAQV NSLADQYKDY YAEARPYLAV KLGYTEGDAE ALYTSVYLPV FKKHYGFFVN ALKASGSGFL VGNSLTFIDL 
LVAQHSADLL GREKSDLFND VPEMKAHSEK VQSIPQIKBCW IETRPASDW 




ATGGTTTCCT 
GAAATATGAA 
CAATGCTCGA 
GGAAAGAGCA 
CCTCGCTGTG 
ACTATGGATT 
CTTGTTGCTC 
TTCCGAAAAA 



ACAAGCTTAC 
GATAACAGAC 
GGTAGATGGC 
GATGGGAAGA 
AAGCTTGGTT 
CTTTGTCAAT 
AGCATTCAGC 
GTTCAGTCAA 



CTACTTCGAT 
TTACCGATGA 
AAACCACTCG 
AGCTCAAGTC 
ACACAGAAGG 
GCTTTGAAGG 
TGATTTGCTG 
TTCCTCAGAT 



GGACGCGGAG 
GGAGTGGGAG 
CTCAGTCCCA 
AACTCCTTGG 
AGACGCGGAG 
CCAGCGGGTC 
GGACGTGAAA 
CAAGAAATGG 



CCGGAGAGCT 
AAGTTCAAAG 
CGCGATGGCT 
CCGACCAGTA 
GCTCTTTACA 
AGGATTCTTG 
AGTCGGATCT 
ATTGAGACTC 



CTGCCGTCAA 

CGGCCGGAAA 

CGTTATCTTG 

CAAAGACTAT 

CAAGCGTCTA 

GTTGGAAATT 

TTTCAATGAT. 

GTCCAGCGAG 



ATCTTTGCTG 
AACCCCATAC 
CTCGGGAATT 
TACGCGGAGG 
TCTTCCAGTT 
CCTTGACTTT 



AACCAGCTTC 
•CGGGTTGAAC 
CTCGTCCATA 
TTCAAGAAAC 
TATTGATTTG 
TGAAGGCACA 



TGACTGGTAA 
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The Fl 1G1 1 .2 gene is located on chromosome I. Regulatory sequences can be found e.g., in the 
region between 4 880 968 and 4 882 068. An exemplary sequence in or around this region is as 
follows: 

• - * * 

■ - • . ■ 

AATTGAAATGAGTTTGCJUITTTTGTAT^ 
AAATCGATGTTTCTAAAATATCTTGAATGAM 

AGAAGCTTTTTGCATTGCAAGAGTGTTGAAATATACAGGATATTTAC^^ 
AAATCAGACCTACACCGCCTTCCTTTGTGGTTTAC 

TTCAAAGCACAAAACC7UVTCACGTCATAATGTTAATTTGACTTTTATTGTCAAAAATT 
<^TGAACATAATAAGAATTTTCAAATTTCGGTGGGCA r ^^ 
TTCAAAGGCAGGTGTATAACCCTCAAAAGAAAGCACGTTTGTGTTTC^^ 
CCATAGGAAATCATTGTTCATCAGGC^CCTTCC(^GAAATTAGGCTGTAGGCAGGCAC 
GCCTCTTTTGCGCGAGATTATGAAATTGTGTTGTACTGTCGGAAAAAT^ 
.TTTTGTGTCAGTTATAGTAGTTT CTTATCATG GTATCTTCAATAATAATGGGA7VGCGTAAC 

-391 

AAGATGATTGATGCCATGGGTTTATATTTGTGAGTAGTCA^ 
GTC^C^AAACCTTGCATATATTTTTTTCAACC^^T^^ 

AACGATGACTAC^CTtTATGATTTCTAGTCAACAACGTGCGCGC^TGTGTAGAGCAAAT GATGACAAACT 
GGTGAGTGGAGAGACGACAGACATTTGAGAAATGGGTATAAATA 
-133 

■m 

gagAcggccggcattcagtgttcaacccttctcatcgaccactcgatttcttgcttggttatttcaacaatg 
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Exemplary Fl 1 Gl 1 .2 (gst-7) sequences: 




MVHYKVSYFP IRGAGEIARQ ILAYAGQDFE- DNRIPKEEWP AVKPSTPFGQ LPLLEVDGKV LAQSHAIARY LARQFGINGK 
CAWEEAQVNS VADQFKDYLN EVRPYFMVKM GFAEGDLDAL AKDVFLPGFK KHYGFFANFL " KSAGSGYLVG * DSLTFVDLLV 
AQHTADLLAA NAALLDEFPQ FKAHQEKVHS NANIKKWLET RPVTPF 



cgaccactcg 
AGATTGCTCG 
AAGCCAAGCA 
TTACTTGGCT 
AGGATTACCT 
GACGTCTTCC 
TGGAGACTCT 
ATGAATTCCC 
GTTACTCCAT 




atttcttgct tggttatttc 
TCAGATCTTG GCCTACGCTG 
CTCCATTCGG ACAGCTTCCA 
CGTCAGTTCG GAATCAATGG 
CAACGAAGTT CGTCCATACT 
TTCCAGGATT CAAGAAGCAC 
TTGACCTTTG TCGACTTGCT 
ACAATTCAAG GCTCATCAGG 
TCTAAatgat ttcca 



aacaATGGTC 
GACAAGACTT 
CTCCTTGAAG 
AAAGTGTGCA 
TCATGGTGAA 
TATGGATTCT 
CGTCGCTCAG 
AAAAGGTTCA 



CACTACAAGG 
CGAGGACAAC 
TTGACGGAAA 
TGGGAGGAGG 
GATGGGATTT 
TTGCTAACTT 
CACACTGCTG 
CTCGAATGCC 



TATCGTACTT CCCAATTOGT 
AGAATCCCAA AGGAGGAATG 
GGTTCTTGCC CAATCTCATG 
CTCAAGTCAA CTCGGTTGCT 
GCTGAAGGAG ATCTCGATGC 
CCTCAAGTCG GCTGGATCOG GATACTTGGT 
ATCTTCTGGC TGCCAACGCA 
AACATCAAGA AGTGGTTGGA 



GGAGCTGGAG 
GCCAGCTGTC 
CTATCGCCCG 
GATCAATTCA 
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The K08F4.7 gene is located on chromosome IV. Regulatory sequences can be found e.g., in the 
region between about 10141800 and 101 422 17. An exemplary sequence of this region is as 
follows: 

* 

ATTATCCAAAAAGATTAGAAGTTGGCAAACCTTGGGCAAGAAT^ 
©CAACTTTATCCAATCTTTTACT 
GACTTGGATATGACTTATAGTTAGCCTAACCT 
TAACCJVGCTCCAATAGTTGTTATl'l^l'GCT ^TTGTCAT CATT 1 !"! 1 

-157 

» - — - • » ■ 

TCGACC^TTTAGACTCTCAAGTGAAACCAACTGT 

-83 

AAATATATATAAACTGGAATTAAAAAC^^TTGATACATCGATTCAATTACTGAACT 
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Exemplary K08F4.7 (gst-4) sequences: 




quelle 



MPNYKLLYFD ARALAEPIRI MFAMLNVPYE DYRVSVEEWS KLKPTTPFGQ LPILQVDGEQ FGQSMSITRY LARKFGLAGK 
TAEEEAYADS IVDQYRDFIF FFRQFTSSVF YGSDADHINK VRFEWEPAR DDFLAIINKF LAKSKSGFLV GDSLTWADIV 
IADNLTSLLK NGFLDFNKEK KLEEFYNKIH SIPEIKNYVA TRKDSIV 




ATGCCAAACT 
GCCTTACGAG 
TACAAGTCGA 
ACTGCAGAGG 
TTCCGTTTTC 
TGGCAATAAT 
ATTGCTGACA 
CAAGATTCAT 
tctgaattat 



ATAAGCTATT 
GATTATAGAG 
TGGAGAACAA 
AAGAAGCTTA 
TATGGAAGTG 
CAATAAGTTC 
ATTTGACAAG 
TCAATTCCAG 
gtatgtagta 



GTATTTTGAT 
TTTCAGTGGA 
TTCGGTCAGT 
CGCTGATTCA 
ACGCTGATCA 
CTGGCCAAGA 
TCTCCTGAAG 
AAATTAAGAA 
aaataatatc 



GCTCGTGCTC 
AGAATGGTCA 
CAATGTCTAT 
ATTGTAGAXC 
TATTAACAAA 
GTAAATCAGG 
AATGGATTCT 
TTACGTGGCA 
gttcctatca 



TTGCTGAGCC 
AAGCTGAAGC 
CACAAGATAC 
AATACAGAGA 
GTACGTTTTG 
ATTCCTCGTT 
TAGATTTCAA 
ACAAGAAAGG 
cgtctcccag 



AATCCGTATC 

CAACGACTCC 

TTGGCAAGAA 

TTTCATATTC 

AAGTTGTTGA 

GGAGACTCAT 

CAAAGAAAAG. 

ATAGTATTGT 

agagcgtaat 



ATGTTTGCAA TGCTCAATGT 

CTTCCCATTT 
AATTTGGACT CGCTGGAAAA 

AATTCACTTC 
GATGATTTCT 
TGATATTGTG 
AGTTCTATAA 
attatttaag 
tatgtg 



TGACTTGGGC 
AAGTTGGAAG 
TTAAaatcga 
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The sod-1 gene is located on chromosome II- Regulatory sequences can be found e.g., in the region 

between about 6 973 806 and about 6 974 406. An exemplary sequence of this region is as follows: 

■ 

a ■ * * 

ATTCCGCAACCCCGTCAAATTTAAG^ 
• TTTTTTTTGTTGCCTTCGGCGTTTTC^ 
TCTTC^TATCTTGTTTCTGGA^CGAAAATTTC 
CACAAAAATTTCACATTTAAATAACC^ 

GTTTGATGCGTGCACACTATTTTGCTGTTGTTTTTTTCTTTTTTCTCTAAATTTTCTTTACGCTTTCGTA 
GTTTCTATAGAAACGATTCTCCACTCCCGGTTTTCTTCCGATTCTCAAAATTAATTAAAATTTAGTTATT 
AAAAATCCTTTTTCTTGAAATAATCGTTCAATTTCGAGTTTTC^AGAGTGGAGAGGTTGAATTTGTGAGC 
CGCTTATTTTTTCTGTGTTTTTGTTTTGTGGTTTTTAATCAGTGT CATAATCATA CTTTCCATTGTTTCT 

-64 

TTATTATTCAAAGTTGTAGATTCAGTATTTTAGATCGGTGATG 
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Exemplary sod- 1 sequences: 




MFMNLLTQVS NAIFPQVEAA QKMSNRAVAV LRGETVTGTI WITQKSENDQ AVIEGEIKGL TPGLHGFHVH QYGDSTNGCI 
SAGPHFNPFG KTHGGPKSEI RHVGDLGNVE agadgvakik LTDTLVTLYG PNTWGRSMV vhagqddlge GVGDKAEESK 
KTGNAGARAA CGVIALAAPQ 




tttagatcgg 
GTCGAACCGT 
AGGCAGTTAT 
AACGGATGCA 
AGGCGATCTA 
GTCCAAACAC 
GAAGAGTCCA 
tgaatcgcgt 
atcccgttgt 



tgATGTTTAT 
GCTGTCGCTG 
TGAAGGAGAA 
TTTCTGCCGG 
GGAAATGTGG 
TGTCGTTGGC 
AGAAGACTGG 
ctctgaatct 
tcctactcct 



GAATCTTCTC 
TTCTTCGTGG 
ATCAAGGGAC 
TCCACACTTC 
AAGCTGGAGC 
CGATCTATGG 
AAACGCCGGA 
ccacacaatt 
actactgtat 



ACTCAGGTCT 
AGAAACTGTT 
TTACTCCCGG 
AATCCATTTG 
CGATGGAGTG 
TTGTTCATGC 
GCTCGTGCTG 
cctactaaag 
attttcacat 



CCAACGCGAT 
ACCGGTACTA 
TCTTCATGGA 
GAAAGACTCA 
GCAAAAATCA 
CGGACAAGAC 
CCTGCGGTGT 
acaatttttc 
aaaatttctt 



TTTTCCGCAG 
TCTGGATCAC 
TTCCACGTTC 
TGGTGGACCA 
AGCTCACCGA 
GACCTCGGCG 
CATTGCTCTC 
atttcttgct 
caaaatttca 



■GTCGAAGCCG 
ACAGAAGTCC 
ACCAATATGG 
AAATCCGAGA 
CAGGCTCGTC 
AGGGAGTCGG 
GCTGCTCCCC 
ttgtcgttat 
aataaaggtt 



CTCAAAAAAT 
GAAAATGACC 
TGATTCCACC 
TCCGTCACGT 
ACGCTTTACG 
AGACAAGGCA 
AGTGActacc 
attcttaaga 
gtagtttc 
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The sod-2 gene is located oh chromosome L Regulatory sequences can be found e.g., in the region 
between about 8 441 038 and 8 442 612. An exemplary sequence of this region is as follows: 

TGAATAAAAACGTTGAACCCAACGGACATCAAAGTATCAAAGTAAGTAAGTAA 

TATAAAAAATCTACTCGAAAATTAAGTGAGAATTGAAGGATTGCTTTCCGAAGAGAAAATG^ 
AAAC^TCAAAAATGTATATTAG 

AACGAGCTCGCTGGAGTTGGCATC^GTGTGGAAGGCAGACACATAAGAA 

ATGTTCCACACCGGGAGAAGTTCAAGAACGACTGGAAATTTTGGACCG 

AGTCi^VAAGACTGTTCTTCTGAAGAACAAGTTTTGCCGGAGCOAAGACGTCCTTTTCJ^ 

CCTGGTTGCCGCTATCTGGGTCGCTGGATCGACATTTCTGGCTCAATTGACGAAGAGATCTCGAGGAGAATAAGAGCAGG 
TTGGGGTGCTCTGGTTGGAATCAAAGAAGTCTTG AGAATCATG CCAAACAAGGA AA 

-980 ~ ' ~ -959 

■ 

» 

GTGCTACCAGCTCTCCTGTATGCTAGTGAAACTTGGACTTGTAATGCTGGATCGACG 
CGGTCTCATCGACGCTGCAGAAATTCGAGGCTGGAACTT 

CAGGACACATTCTACGGAGAGATCCAAACCGATGGACAAAAATCTGCACGGAATGGGAC^ 
AAACGTGCCGTTGGAGGACAGAAGAAGAGATGGGCTAAGGAGATCGACGAAGAATA 
CATGTCGGGACAAGTCGTTGTTGGGAGAAGAAGACTAGGAATGCTCACTCCGAAGGCTCCAT^ 
CCGACCGTGAAAAATGGAAAGAGTTTGTCCGCAGTTGCCTCGCAACTTGAACCC^GGGACATCAAftGTATCA^ 

TAAGTAAGTAAGTAACCTGAATAAAAACGTTGCAATTAAAAAATCTACTCGAAAATTAAGTGAGAATTGAAGGATTGCTT 
TCCGAAGAGAAAATGACAATTAT AGGG TATAC T AAAACATCAAAAATGT ATAT TAGACTACCATAAAT ATT ACGATAAT 
-363 . ^ 

• ♦ 

TTAAAAATTACTAGAAACACGCAATTCGGCTCAAAAAGCAAC7VATTTAGACTGAAAACGAGCTAAAAG^ 
AAACCACTTTGCTCGGTAAATCTG GTGTATCATG TTCCGCAAACACTGTCTTTTGTTTTGCG 

-191 

TACTTTGTTTACGCGCATTCGAATTTCAGTGTTCGCGCTTTTTGTTTACTTTTTTATTTTTCATCCAAAAAT-GGTATTTT 
CAGCTTGATATGTTTCTGCGAATTGTAAAAATTTATATTTGACTATTGAATATTTTAATTATTTGCAGCCGAAAATG 
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Exemplary sod-2 sequences: 




MLQNTVRCVS KLVQPITGVA AVRSKHSLPD LPYDYADLEP VISHEIMQLH HQKHHATYVN NLNQIEEKLH EAVSKGNVKE 
AIALQPAIJCF NGGGHINHSI FWTNLAKDGG EPSAELLTAI KSDFGSLDNL QKQLSASTVA VQGSGWGWLG YCPKGKILKV 
ATCANQDPLE ATTGLVPLFG IDVWEHAYYL QYKNVRPDYV NAIWKIANWK NVSERFAKAQ Q 




tttgcagccg 
GCTCGAAGCA 
CATCATCAAA 
AAACGTCAAA 
CTAATTTGGC 
CTTCAAAAAC 
GATCTTGAAG 
TCTGGGAGCA 
AAGAACGTCA 
gaagttgact 
aaagttttca 



aaaATGCTTC 
CTCGCTGCCA 
AGCATCATGC 
GAAGCTATCG 
AAAGGACGGA 
AGCTTTCGGC 
GTTGCCACAT 
CGCTTACTAC 
GCGAGCGTTT 
cccattgttt 
ccg 



AAAACACCGT 
GATTTACCAT 
CACTTATGTG 
CTCTTCAGCC 
GGAGAACCAT 
ATCAACTGTC 
GTGCCAATCA 
TTGCAGTACA 
TGCAAAGGCA 
tgtaactatt 



TCGCTGTGTC 
ACGACTATGC 
AACAATCTCA 
AGCTCTCAAG 
CGGCGGAGTT 
GCTGTTCAAG 
GGATCCACTT 
AGAATGTTCG 
CAGCAATAAa 
tttgtttctt 



TCAAAGCTTG 
TGATTTGGAG 
ACCAAATTGA 
TTCAATGGAG 
GCTCACCGCA 
GATCAGGATG 
GAGGCAACAA 
ACCAGATTAT 
tgagctgaat 
aattatttcg 



TTCAACCGAT 
CCTGTAATCA 
GGAAAAGCTT 
GAGGACATAT 
ATTAAGAGCG 
GGGATGGTTG 
CTGGACTTGT 
GTCAATGCTA 
cacaagaatt 
aaatgtaaat 



CACAGGAGTC 
GTCACGAGAT 



TATGCAACTT 
TCTGCAAAGG 
ATCTTCTGGA 



ACTTCGGATC 
GGATACTGTC 
TCCACTGTTC 
TTTGGAAGAT 
aatcgtcaaa 
tttcaaacct 



GGAATTGACG 
CGCCAACTGG 
tgtagcagta 
tttcaaatga 
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The ctl-1 gene is located on chromosome DL Regulatory sequences can be found e.g., in the region 
near 14 306 135. An exemplary sequence of this region is as follows: 

AAAAAAAAATCGATAAAA^TCCGCGTCAA^ 
GTACCGTAGTTTTTGGTGAAAACATTGCAAAATTTGGTCJ^ 

nrTTTATTTTATTTTTGGGTTTCCCTTATC GCTTATCATA AACATGTGAC GTCATCATC TCTTGTACAGA 

-997 -978 

• ■ 

GCACCGCGACTGGGAGTATAAGAATCGCCGGAAAACATCAATAATCAGTTCGGTAGAAGTGAA^ 

CGT AAAATATGATCATT TT TCGATGCACC^TATT TGACGCGCAATACTTCTA<^ TGTACTGC 

-880 

TCGTGGACAACTT T GGATTATTTTTTGTTT TT AAAATTCAAAATAGTCAATATATTGCTTAT T T ATAGCG 
CGCCTTTTTGACAGTAAGTTTGTCAAATTTGCGCGTAAGTTATGGTGTTTCCACATATGCACCATACAGG 
AACACCCCGCGGCCCGGCTAGTGGTACATCCATGCAAATGCGCTCTACTGATAATTTGAGTTTAACCAGG 
TTTAGGCGCAAGATAAGAAAAAAGCTTTGGACCAAAAAATTTAGAGTTTATTTTTTTCGGACATTTTTTA 
TATACATCACAAAAATATTGGGCCACTCGTTTTTGATAAAAACGAG^ 

TAGACAAATTGCGTACAGGTACCACTTTTCC^CGTAGTGCCAGGTTGTCCCATTACGCTTTGATC 
AAAATGCGGGAATTTTTCGTCCAGAAAAATGTGACGTCAGCACGTTCTCAACCATG 

AACTCTGCGTCTATTCTCCCGCATTTTTTGTAGATCTGTAGATTTGTAGATCAATCCATTGCCCGTATAC 
CCTGACCCATAATCAATACCTACCTAATTTTTGTCTTTCCCCCTACTTTTTTGCCTGTCCAAAATAAGCG. 
AGACTATGCCGTAGTCTGGTGTCCAACAAGATGTTCCTTATCAGTGATAACGCTACAATCTTCTTTCTTT 

TTTCTCTGTTTCTCTTGTCTCTCCCAACCCA 
AGAGTTTTATTTAATTCTAAATTTCCTAACTAAAATTTCAGA 
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Exemplary ctl-1 sequences: 



P 





MPNDPSDNQL 

FEVTHDITKY 

IHALKRNPQT. 

LDPTDAGKLA 

YFAEVEQAAF 

PNSFRGYRTR 

HPDFGNALRH 



KTYKETYPKP 
CKADMFNKVG 
HMRDFNALFD 
SSDPDYAIRD 
CPAHIVPGIE 
DDVKESTFQT 
QLCQKKH 



QVITTSNGAP 
KQTPLLVRFS 
FWMNRPESIH 
LFNAIESRNF 
FSPDKMLQGR 
TGDVDRYETG 



IYSKTAVLTA GRRGPMLMQD WYMDEMAHF 
TVAGESGSAD TVRDPRGFSL KFYTEEGNWD 
QVMFLYSDRG IPDGFRFMNG YGAHTFKMVN 
PEWKMFIQVM TFEQAEKWEF NPFDVTKVWP 
IFSYTDTHYH RLGPNYIQLP VNCPYRSRAH 
DDHNYEQPRQ FWEKVLKEEE RDRLVGNLAS 



DRERIPERW 
TjVGNNTPI FF 
KEGNPIYCKF 
HGDYPLIEVG 
TTQRDGAMAY 
DLGGCLEEIQ 



HAKGAGAHGY 
IRDAIHFPNF 
HFKPAQGSKN 
KMVLNRNVKN 
ESQGDAPNYF 
NGMVKEFTKV 




aaaATGCCAA ACGATCCATC GGATAATCAA 



CTGAAAACCT ACAAGGAGAC GTATCCAAAA CCCCAAGTGA TCACAACTTC AAATGGAGCT CCGATCTACT CGAAGACCGC 
CGTGCTCACC GCCGGGCGGC GTGGCCCAAT GCTCATGCAA GATGTAGTTT ATATGGATGA GATGGCTCAT TTCGATCGTG 
AACGTATCCC CGAGCGTGTC GTTCATGCCA AGGGAGCCGG AGCCCATGGA TACTTCGAGG TCACCCATGA CATCACCAAG 
TACTGTAAGG CCGATATGTT CAACAAGGTC GGAAAACAGA CACCACTTCT CGTTCGTTTT TCAAOGGTCG CTGGAGAATC 
GGGATCCGCT GATACTGTCC GCGATCCACG TGGATTCTCT CTCAAATTCT ATACCGAGGA GGGTAACTGG GATCTGGTTG 
GAAATAACAC TCCGATCTTC TTCATTCGTG ACGCAATCGA CTTTCCGAAT TTCATTCATG CCCTGAAGCG CAATCCACAG 
ACTCACATGA GGGATCCGAA TGCGCTCTTC GATTTCTGGA TGAATCGCCC TGAATCCATT CATCAGGTGA TGTTCCTCTA 
CTCGGATCGT GGAATTCCTG ATGGATTCCG TTTTATGAAT GGATACGGAG CGCATACTTT CAAGATGGTC AACAAGGAGG 
GAAATCCGAT TTATTGTAAA TTCCATTTCA AGCCTGCTCA AGGTTCCAAG AATCTCGATC CAACTGACGC TGGAAAGCTC 
GCCTCTTCGG ATCCAGACTA TGCGATCCGC GACCTGTTCA ATGCCATTGA GTCAAGAAAT TTCCCGGAAT GGAAGATGTT 
CATTCAAGTG ATGACATTCG AACAAGCTGA GAAATGGGAG TTCAATCCAT TTGATGTCAC TAAAGTTTGG CCACACGGTG * 
ATTACCCACT GATCGAGGTC GGCAAGATGG TGCTGAACAG GAATGTGAAG AATTATTTCG CTGAGGTCGA ACAAGCCGCC 
TTCTGCCCGG CCCACATCGT CCCAGGAATC GAGTTCTCGC CAGACAAGAT GCTCCAAGGG CGTATCTTCT CCTACACGGA 
CACGCATTAC. CATCGCCTTG GACCAAACTA CATTCAGCTT CCAGTCAACT 6CCCGTACCG CTCCCGTGCT CATACCACTC 
AACGCGATGG TGCAATGGCT TATGAAAGCC AGGGAGATGC GCCGT^ATTAC TTCCCGAACA GTTTCCGCGG ATACCGTACT 
CGTGATGATG TGAAGGAGTC GACATTTCAG ACGACTGGAG ATGTTGATCG TTATGAGACT GGAGACGATC ACAACTACGA 
GCAGCCACGT CAGTTCTGGG AGAAAGTGCT CAAGGAGGAG GAGAGAGATC GGCTCGTTGG GAATTTGGCT AGTGATTTGG 
GTGGCTGTTT GGAGGAAATT CAAAATGGAA TGGTCAAAGA GTTCACGAAA GTTCATCCGG ATTTCGGAAA TGCTCTTCGC 
CATCAGCTCT GCCAGAAGAA GCATTAAatt 
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The sod-3 gene is located on chromosome X. Exemplary regulatory sequences include: 

TATTCG CAGAAAAAAGTCGTTGCAAACATTCG TTTTTATATGTTTTTCTTTGAG 

TTGAAAGTGAAAAATATTTGCTTAAAACTTCCyiAA 

ATTTTCAAAAACATTCAATCTTTTCCCTTGGAGT^ 

TGGATGCCCTGATAAAAAGTAGGTGAAATTTCGCAGGGGAACATCATATTAAAATC 

GAAATGGAAATGTTTGTCGGTGGTATGCTCGAATA1TTC 

TTTTTGACAAACGGAAAAAATTTGTGTCGAAATACTACATTTTCGATAAC^ 

CTTATAAAAACTGTTTGACTATCTTATTTCAGGAAAAAAAAATCCAAG 

AACTTTCTAATGGCTGATTAATAAAACAAAGTTATA 

TCTTGTGTTTTTTTTTGAATATTTC^^ 

ACAATTTTTCAAACCGAAAATTGACCTTTGACTTTGTTTACTTTG1T 

OTTCTATTGCTGTTGATGAGGTCTTTGATCAAATTTGTATO 

AAATCATCTAATATATTGTCAAACAACTTCTTGTTTTTTTTTT 

^^^^^^^^^^^^^^ 

-287 

CTTAAGAAAGGTTCACACAACAACTCTCCTCTCGA^ 

TGTTTGCCAGTGCGGGTTGTTTACGCGTTTTCAAGATTTTTGGTCTCCTAT 

TTTTTCCTTTCATTTGGTTTTTTTCTGTTCGAGAAAAGTGACC 

AATAAAATGCTG 
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Exemplary sod-3 sequences: 




MLQSTARTAS KLVQPVAGVL AVRSKHTLPD LPFDYADLEP VISHEIMQLH HQKHHATYVN NLNQIEEKLH EAVSKGNLKE 
AIALQPALKF NGGGHINHSI FWTNIAKDGG EPSKELMDTI KRDFGSLDNI* QKRLSDITIA VQGSGWGW1/3 YCKKDKILKI 
ATCANQDPLE GMVPLFGIDV WEHAYYLQYK NVRPDYVHAI WKIANWKNIS ERFANARQ 




cgtttgtcaa 
TTGCGGGAGT 
AGCCATGAAA 
TCACGAGGCT 
TCAATCATTC 
GACTT CGGTT 
GGGATATTGC 
TTGGAATTGA 
ATTGCCAACT 
acgaaaatat 
cacagttttt 



atcttctaat 
TCTCGCCGTC 
TCATGCAGCT 
GTTTCGAAAG 
TATCTTCTGG 
CCCTGGATAA 
AAGAAAGACA 
CGTTTGGGAG 
GGAAGAATAT 
tgaagataat 
tctta 



tttcagtgaa 
CGCTCCAAGC 
TCATCAT CAA 
GGAATCTAAA 
ACCAACTTGG 
CTTGCAAAAA 
AAATCTTGAA 
CACGCCTACT 
CAGCGAGAGA 
tcagatgtag 



" taaaATGCTG 
ACACTCTCCC 

. AAGCAT CATG 
AGAAGCAATT 
CTAAGGATGG 
CGTCTTTCTG 
GATCGCCACC 
ACTTGCAGTA 
TTTGCCAATG 
tttaaaacgc 



CAATCTACTG 
AGATCTCCCA 
CCACCTACGT 
GCTCTCCAAC 
TGGAGAACCT 
ACATCACTAT 
TGTGCAAACC 
CAAAAATGTC 
CTCGACAATA 
tgagaatttg 



CTCGCACTGC 
TTCGACTATG 
GAACAATCTC 
CAGCGCTGAA 
TCAAAGGAGC 
TGCGGTTCAA 
AGGATCCTTT 
CGCCCAGACT 
Aaagcaggaa 
tatttttgta 



TTCAAAGCTT 
CAGATTTGGA 



GTTCAACCGG 
ACCTGTAATC 
AGGAGAAACT 



ATTCAATGGT 
TGATGGACAC 



TATTAAGCGC 



GGAAGGAATG 
ATGTCCATGC 



attgtttaaa 



GTCCCACTTT 
TATTTGGAAG 

tttcggtttt 
taaaagaacg 
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The gcs-1 gene is located on chromosome n. An exemplary regulatory sequence is: 

■ 

TTATCAACCACTAGGTTCCGTCTTAATCGTCC&AM 

GTTTTTCTGTTTCTATAGTTCTCCATTTTCCATCTCCTCTTCGCTTGTTGAATGGACTTTATTTTGATAAGTTCATTTTA 
ATTTTTCTAACAATCTCATGACTAGCTCATG ATGACAATT GCAAAGAAATTCGTCATATAGAGGGGAAA 

-607 . 

* 

ATATTGAAAAGCCTTCAGGAGAGATGTAGAGACGTAGGAGTAGAGACAGAACATAAATTT<3^ 

AGACAT AGAGTTACCATGGGAAAAACGCTCGC ATT T TCCATT TAACGAGATTTTCTAGATCACAACATTT TGTGATCX2GT 
TGTGCGAAAATCAAGCTTTTTATCAAACTTTTATCGTCTGTTCATTCTTTCTGACAATCTTTATTATCTTATTAAACTTG 
ACTAATTGTATTGAAAGTATTTTTTTAGATGCGAACGAAGTTCCATTTTT CATGACTTAA CATCTCTTAACGTTAGTGAA 

-316 

■ 

* • 

ATTTTTGAATTCCAATTAGGACTACGGTAGGAGTTCTGTAGTTGATTTCCTGAACACTTGTTTTGTAACCTTTCTGAACG 
QATTTTAATATTTCTAAAATTTTAAATTGCAAATCTGAGTCCT 

ATATGA CTTTATG&TG ATGAGATTTAATGTTTCCTTTTGATTTTCTGAATTC 
-121 

ACTGATGTAACTTTCCTTCTAATGTTATCATTTGTATTTTTTTGCAGAATC 
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Exemplary gcs-1 sequences: 




MGLLTKGSPL 
EEQVNAMLGT 
EVAADRKNDD 
IYMDHMGFGM 
PLKNSKWVID 
EHFETIQSSN 
QKDAVLNQKF 
QYLNFISKRA 
AEEHMIVSSQ 



TWAETVPHID 
ANRFLWRPEF 
AANSVFWPEQ 
GCCCIiQVTFQ 
KSRYDSTDCY 
WMNMRFKPPP 
LFRKGLAECK 
TGEINTLAHW 
KRAH 



YIKKHGIAQF 
GSYMIEGTPG 
AVFLGHPRFK 
AVNVDEARWL 
IYPCSVGYND 
PDAPEIGWRV 
SAPENLKGSE 
TRGFVQSHPA 



INLYHRLKSR 
MPYGGLIACF 
NLTKNIKGRR 
YDQLTPITPI 
IPLQYDETIY 
EFRPTEVQLT 
KCGPPSQDIE 
YKHDSDVNDN 



HGDQLKWGDE 
NIVEANMKLR 
GSKVAINVPI 
LLALSAATPI 
KQLIDGNIDE 
DFENAAYCCF 
EMSIDEIXNG 
IVYDLU 



JAM* 



IEYTTVKFDD 
RQWKKLLKK 
FKDTNTPSpF 
FRGKLSNVDS 
PLAKHIAHMF 
WLLTRMMIS 
KKNGFPGLIS 
AISNGEDHCE 



ANKKVRVSCK 
DETCLSISFP 
VEDLSALGGP. 
RWDIISASVD 
IRDPHQVFRE 
FRI/FYLMPIS 
LIRQFLDSAD 
KLLGCYRSKT 



AEELLNKLQA 
SLGVPGFTFP 
DDTRDAKPDH 
DRT PEERGLE 
RIEQDDEKSS 
MVTENMKRAQ 
VDVDTRCTIS 
DHAISAAVRK 





tttgcagaAT 
CACGGAATTG 
TGAATACACT 
TACAAGCCGA 
ATCGAGGGAA 
ACAGGTCGTC 
CATTCCCGGA 
GGCCATCCAC 
CAAGGATACG 
CTGATCACAT 
GATGAAGCCA 
CCGTGGAAAA 
GATTGGAACC 
TCTGTTGGCT 
ACTGGCAAAA 
AAAGCAGTGA 
GAAATGGGAT 
TGTATTGCTC 
GTGCTCAGCA 
AATTTGAAAG 
GAAAAATGGA 
CGATTTCTCA 
GTACAATCTC 
CATCTCAAAC 
TTCGCAAAGC 
aatttattta 
tttagttggc 
aaatttaaac 
tttcatattt 



GGGTCTTTTG 
CTCAATTCAT 
ATTGTAAAAT 
AGAGCAGGTG 
CCCCCGGAAT 
AAAAAGTTAT 
AGTAGCAGCT 
GTTTCAAGAA 
AACACCCCCA 
TTATATGGkT 
GATGGTTGTA 
TTATCCAATG 
TCTCAAGAAT 
ACAATGATAT 
CATATTGCGC 
ACACTTTGAA 
GGAGAGTCGA 
ACCAGAATGA 
AAAAGATGCA 
GATCGGAGAA 
TTCCCAGGTC 
ATATTTGAAC 
ATCCTGCATA 
GGAGAAGATC 
TGAAGAGCAC 

tgttctatac 
catctacatt 
tctgaatttc 
atgtacaatg 



ACGAAAGGTA GTCCGTTGAC 
CAATCTCTAC CATCGTCTGA 
TTGATGACGC AAACAAGAAA 
AATGCGATGC TTGGAACTGC 
GCCTTACGGA GGTCTCATCG 
TAAAGAAGGA TGAAACATGT 
GATAGAAAGA ATGATGATGC 
TCTTACCAAA AATATTAAAG 
GTCCATTCGT TGAAGATTTA 
CATATGGGAT TCGGAATGGG 
CGATCAGCTG ACACCGATTA 
TCGATTCTAG ATGGGATATC 
TCGAAATGGG TTATTGATAA 
TCCTCTTCAA TACGACGAAA 
ATATGTTCAT TCGTGATCCA 
ACAATTCAAT CATCGAATTG 
ATTCCGGCCA ACTGAAGTTC 
TGATCTCCTT CAGGCTGACA 
GTTCTCAATC AGAAATTCCT 
ATGTGGACCA CCTAGTCAAG 
TCATTTCACT TATTCGCCAA 
TTTATTTCGA AACGAGCAAC 
CAAACATGAC AGTGATGTAA 
ACTGTGAGAA GCTGCTCGGA 
ATGATCGTGT CCAGCCAAAA 
gtcgtgtttc ccattccttc 
aaattactga tatgttgatg 
taaatgcttg ttttttgagt 
ctcccctgaa tccatcatat 



GTGGGCAGAA 
AATCAAGACA 
GTTCGCGTGT* 
CAATCGATTC 
CTTGCTTCAA 
CTATCGATAT 
AGCTAATAGC 
GTCGCAGAGG 
TCTGCACTTG 



CACCGATTCT 

ATTAGTGCAA 

GAGTCGCTAC 

CCATATATAA 

CATCAAGTTT 

GATGAACATG 

AACTGACCGA 

TATTTGATGC 

GTTCAGAAAA 

ATATTGAAGA 

TTTCTAGATT 

TGGAGAGATT 

ATGATAATAT 

TGCTACCGCT 

ACGTGCACAT 

taggccttcc 

ctattttcta 

agtaggaatc 



ACCGTACCGC 

CGGAGATCAA 

CGTGCAAAGC 

CTTTGGAGAC 

CATTGTCGAG 

CGTTCCCATC 

GTTTTCTGGC 

AAGTAAAGTA 

GAGGTCCTGA 

CTTCAAGTCA 

ACTGGCACTC 

GTGTCGACGA 

GACTCCACGG 

ACAACTAATT 

TCCGTGAGCG 

CGATT CAAGC 

CTTTGAAAAT 

CAATTTCAAT 

GGATTGGCTG 

AATGTCGATT 

CTGCTGATGT 

AATACTTTGG 

AGTTTACGAT 

CTAAAACCGA 

TAGgcgataa 

atgattcaca 

gtaagcagat 

agtacgaatg 

cgtgttg 



ACATTGATTA 
TTGAAATGGG 
TGAAGAGCTT 
CAGAATTCGG 
GCAAACATGA 
TCTTGGAGTA 
CAGAACAAGC 
GCTATCAACG 
TGATACTCGT 



ACTGTTACAT 

GATGGAAATA 

TATCGAACAG 

CACCACCACC 

GCAGCATACT 

G(5TTACTGAA 

AGTGCAAATC 

GATGAGATTA 

TGATGTGGAT 

CTCACTGGAC 

CTTTTGAAAA 

TCATGCCATT 

ttgattgatt 

atttttcgat 

grtcagtgttt 

gtacattaat 



TATCAAGAAG 
GAGATGAGAT 
CTTAATAAGT 
ATCCTACATG 
AATTGCGCAG 
CCTGGATTCA 
TGTATTCTTG 
TCCCGATATT 
GATGCGAAAC 
TGTGAACGTC 
CACCAATCTT 
GAGGAAAGAG 
TTATCCATGT 
TTGATGAGCC 



AGATGCTCCA 
GTTGCTTCGT 
AATATGAAGC 
TGCTCCCGAA 
TCAATGGAAA 
ACT CGGTGTA 
ACGTGGATTC 
AGATGGATGC 
TCTGCTGCTG 
atgtgatttt 
gccatatcaa 
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T3.9E7.2c (spliced) 

aatcgttctt cttcttattt tctacagctg atgatgtttg atgaaggttt tattttcctt gctttttcca 

ccctgttaat attattttcg atattcccaa aaataattcc aaattttcag tccatattca fcctggatact 

tgcaaeatca tcactgattt tggtgatcag ttcaccatcg tccaacacct caatccaatc atcgtcatac 

gatcggatca cgacaaaaca tcttctggac aatatatcac cgacatttag tgagtatgac ttgaaaagtg 

jcatctgatca cttttcgagc cgttttgtcg ctagggactt tttaatgaat cagatgtact tttcgaattt 

tttagagcaa aagcagtagt tgcacttttg aaacttaaat taatatacaa aactatgata tatattttca 
qaaftJG' 












rl 





gaagc 



cctctt-ttat cacataaaat ctcggtcgaa accttattaa agccacataa ttaaagataa ttaattccgc 

cacaataatc gtttttttpt tctttgccgt gtctcatttc attttgatct actctttcct cccttcggat 

tctttgattt cccagtgaaa tacctcaccc acttcaatcc ccacaaagtg agcaacccct atcttgcaac 

agttttatca tctcttcatc atacccagtt tgataattta ttatctgatc cccatcccct tgtcgcctct 

cattagtatc ctagtttttc atttgagccc. ggagctcaga ctacatctcc gaatcatcat acaaatagat 

agaaacgggt ctcgtgacga aagaatacgt gcaccacacg acccccccat cctgttcacc cccatacacc 

*tgaaaaatat gatctttaca gttatttcta ttatatcctc aaatctctcg taatatcgta ; tcaatttcct 

cttctttttt gteattttca atttttctca aatttctcag atrctattctt ttt^ttgtat ttttggaact 
tgtatccctc ctccatcccc agact 



T19E7.2c (conceptual translation) 
MYTDSNNRNF DEVNHQHQQE QDFNGQSKYD 
PPASPFGRGP STERPTTSSR YEYSSPSLED 
APLTAEENAR YEDLSKGFYN GFFESFNNNQ 
EDVSREEGQL NQLFDNKQQH PV1NNVSLSE 
VTDSQTVEQW LPTEWPNDV FPTSNYAYIG 
QQTQTSPGSV TVTATATQSL FDPYHSQRHS 
PSSGgRYQRS SSPRSSQSSI KIARWPLAS 
NESLSEYQRQ LIRKIRRRGK NKVAARTCRQ 



YPQFNRPMGL 
IDLIDVLWRS 
YQQKHQQQQR 
GIVYNQANLT 
MQNDSLQAW 
FSDCTTDSSS 
GQRKRGRQSK 
RRTDRHDKMS 



RWRDDQRMME 
DIAGEKGTRQ 
EQIKTPTLEH 
EMQEMRDSCN 
SNGQIDYDHS 
TCSRLSSESP 
DEQLASDNEL 
HYI* 



YFMSNGPVET 
VAPADQYECD 
PTQKAELEDD 
QVSISTIPTT 
YQSTGQTPLS 
RYTSESSTGT 
PVSAFQISEM 



VPVMPILTEH 
LQTLTEKSTV 
LFDEDLAQLF 
STAQPETLFN 

PLIIGSSGRQ 
HESRFYGKLA 
SLSELQQVLK 
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T19E7 . 2b (spliced) ^ 

GGCT®5SSCA r ;GCT6TCAACT' CTTTTGftCg^ |^M|Effi^ nWMa * 
" ^$3\TGAGAGA «=~==™™ 






— ^ ^_l°^gc cctcttttat cacataaaat ctcggtcgaa accttattaa 

kgccacataa ttaaagataa ttaattccgc cacaataatc gtttttttct tctttgccgt gtctcatttc 
attttgatct actctttcct cccttcggat tctttgattt cccagtgaaa. tacctcaccc acttcaatcc 
ccacaaagtg agcaacccct atcttgcaac agttttatca tctcttcatc atacccagtt tgataattta 
ttatctgatc cccatcccct tgtcgcctct . cattagtatc ctagtttttc atttgagccc ggagctcaga 
ctacatctcc gaatcatcat acaaatagat agaaacgggt ctcgtgacga aagaatacgt gcaccacacg 
acccccccat cctgttcacc cccataca'cc tgaaaaatat gatctttaca gttatttcta ttatatcctc 
aaatctctcg taatatcgta tcaatttcct cttctttttt gtcattttca atttttctca aatttctcag 
atctattctt tttcttgtat ttttggaact tgtatccctc ctccatcccc agacttcccc ttcccagtta 
ctcttgtaca ttttcatata tgtccatata tcgtttgaat ctctcattta 

* * * 

T19E7.2b (conceptual translation) 

MSLPSDFASS LLASSTTTNT TNTAPAAVNS FDEQEEESKK ILNMYLQMFN QQQVDQHGHH HQHPYAYSGV 
SSTFDRVFPT SNYAYIGMQN DSLQAVVSNG QIDYDHSYQS TGQTPLSPLI IGSSGRQQQT QTSPGSVTVT 
ATATQSLFDP YHSQRHSFSD CTTDSSSTCS RLSSESPRYT SESSTGTHES RFYGKLAPSS GSRYQRSSSP 
RSSQSSIKIA RWPLASGQR KRGRQSKDEQ LASDNELPVS AFQISEMSLS ELQQVLKNES LSEYQRQLIR 
KIRRRGKNKV AARTCRQRRT DRHDKMSHYI * 
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T19E7.2a (spl i ced) • [ 




jgaagccct cttttatcac 

ataaaatctc ggtcgaaacc ttattaaagc cacataatta aagataatta attccgccac aataatcgtt 

tttttcttct ttgccgtgtc tcatttcatt ttgatctact ctttcctccc ttcggattct ttgatttccc 

agtgaaatac ctcacccact tcaatcccca caaagtgagc aacccctatc ttgcaacagt tttatcatct 

cttcatcata cccagtttga taatttatta tctgatcccc atccccttgt cgcctctcat tagtattscta 

gtttttcatt tgagcccgga gctcagacta catctccgaa tcatcataca aatagataga aacgggtctc 

gtgacgaaag aatacgtgca ccacacgacc cccccatcct gttcaccccc atacacctga aaaatatgat 

ctttacagtt atttctatta tatcctcaaa tctctcgtaa tatcgtatca atttcctctt. cttttttgtc 

attttcaatt tttctcaaat ttctcagratc tattcttttt cttgtatttt tggaacttgt atccctcctc 

catccccaga cttccccttc ccagttactc ttgtacattt tcatatatgt ccatatatcg tttgaatctc 

tcatttatgg aaataaattt gaaaaaatc 



T19E7.2a {conceptual translation) 
MGGSSRRQRS TSATRRDDKR RRRQCFSSVA" 
SSYDRITTKH LLDNISPTFK MYTDSNNRNF 
YFMSNGPVET VPVMPILTEH PPASPFGRGP 
VAPADQYECD LQTLTEKSTV APLTAEENAR 
PTQKAELEDD LFDEDLAQLF EDVSREEGQL 
QVSISTIPTT STAQPETLFN VTDSQTVEQW 
YQSTGQTPLS PLIIGSSGRQ QQTQTSPGSV 
RYTSESSTGT HESRFYGKLA PSSGSRYQRS 
PVSAFQISEM SLSELQQVLK NESLSEYQRQ 



DDEEETTSIY 
DEVNHQHQQE 
STERPTTSSR 
YEDLSKGFYN 
NQLFDNKQQH 
LPTEWPNDV 
TVTATATQSL 
SSPRSSQSSI 
LIRKIRRRGK 



GVSSIFIWIL 
QDFNGQSKYD 
YEYSSPSLED 
GFFESFNNNQ 
PVINNVSLSE 
FPTSNYAYIG 
FDPYHSQRHS 
KIARWPIAS 
NKVAARTCRQ 



ATSSLILVIS 
YPQFNRPMGL 
IDLIDVLWRS 
YQQKHQQQQR 
GIVYNQANLT 
MQNDSLQAW 
FSDCTTDSSS 
GQRKRGRQSK 
RRTDRHDKMS 



SPSSNTSIQS 
RWRDDQRMME 
DIAGEKGTRQ 
EQIKTPTLEH 
EMQEMRDSCN 
SNGQIDYDHS 
TCSRLSSESP 
DEQLASDNEL 
HYI* 
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■ 

Glycogen synthase kinase-3 bete (GSK-3 beta) . 

1 msgrprttsf aesckpvqqp safgsmkvsr dkdgskvttv vatpgqgpdr pqevsytdtk 
61 vigngsfgw yqaklcdsge lvaikkvlqd krfknrelqi mrkldhcniv rlryffyssg 
121 ekkdevylnl vldyvpetvy rvarhysrak qtlpviyvkl ymyqlfrsla yihsfgichr 
181 dikpqnllld pdtavlklcd fgsakqlvrg epnvsyicsr yyrapelifg atdytssidv 
241 wsagcvlael llgqpifpgd sgvdqlveii kvlgtptreq iremnpnyte fkfpqikahp 
301 wtkvfrprtp peaialcsrl leytptarlt pleacahsff delrdpnvkl pngrdtpalf 
361 nfttqelssn pplatilipp. hariqaaast ptnataasda ntgdrgqtnn aasasasnst 
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Human Glycogen synthase kinase-3 alpha (GSK-3 alpha) . 

• * 

1 msgggpsggg pggsgrarts sfaepggggg gggggpggsa sgpggtgggk asvgamgggv 
61 gasssgggpg gsggggsggp gagtsfpppg vklgrdsgkv ttwatlgqg persqevayt 
121 dikvigngsf gwyqarlae trelvaikkv Iqdkrfknre lqimrkldhc nivrlryffy 
181 ssgekkdely Inlvleyvpe tvyrvarhft kakltipily vkvymyqlfr slayihsqgv 
241 chrdikpqnl lvdpdtavlk lcdfgsakql vrgepnvsyi csryyrapel ifgatdytss 
301 idvwsagcvl aelllgqpif pgdsgvdqlv eiikvlgtpt reqiremnpn ytef kfpqik 
361 ahpwtkvfks rtppeaialc sslleytpss rlspleacah sffdelrclg tqlpnnrplp 
421 plfnfsagel siqpslnail ipphlrspag tttltpssqa ltetptssdw qstdatptlt 
481 nss 
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Mouse Glycogen synthase kinase-3 beta. 

1 msgrprttsf aesckpvqqp safgsmkvsr dkdgskvttv vatpgqgpdr pqevsytdtk 
61 vigngsfgw yqaklcdsge lvaikkvlqd krfknrelqi mrkldhcniv rlryffyssg 
121 ekkdevylnl vldyvpetvy rvarhysrak qtlpviyvkl ymyqlfrsla yihsfgichr 
181 dilcpqnllld pdtavlklcd fgsakqlvrg epnvsyicsr yyrapelifg atdytssidv 
241 wsagcvlael llgqpifpgd sgvdqlveii. kvlgtptreq iremnpnyte fkfpqikahp 
301 wtkvfrprtp peaialcsrl leytptarlt pleacahsff delrdpnvkl pngrdtpalf 
361 nfttqelssn pplatilipp hariqaaasp panataasdt nagdrgqtnn aasasasnst 
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Mouse Glycogen synthase kinase-3 alpha (GSK-3 alpha) . 

1 masttamdvl eelssdssek qrsvnildsf vkdmferias easflarqar nstinsreiq 

. 61 tairlllpge lcrrgtgcgk asvwamgggv gasssgvggg sggpgstsfl qpgvklghds 

121 rkvttwatv gqdpersqev actdikvign gsf gwyqew ladtrelvai kkvlqdkrf k 

181 yrelqimckl dhcnivrlqy ffyssgekkd dlylnlvley vpetvyxvar hftkakliip 

241 iiyvkvymyq lfrslayihs qgvchrdinl Ivdpdtailk Icdfgsakql vlgttvapel 

301 ytssidvxsa gcvlaellls qpifpgdngv dqlveiikvl gtptreqlre mnpkytefkf 

361 pqikahpwtk vfksrtaprp lhsalacwst hhtqgsphlr lvptaslmnc gvsgpapqrp 

421 ptspcstsvl vicpsnhlsm pfssllt 
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GSK-3 [ Ca enorhahdi ti s elegrajxs] . 

■ 

1 mnkqllscsl ksgkqvtmvy asvatdgvdq qveisyydqk vigrigsfgw flaklsttne 
61 mvaikkvlgd krfknrelqi mrklnhpniv klkyffyssg ekkdelylnl ileyvpetvy 
121 rvarhyskqr qqipmiyvkl ymyqllrsla yihsigichr dikpqnllid pesgvlklcd 
181 fgsakylvm epnvsyicsr yyrapelifg atnytnsidv wsagtvmael llgqpifpgd 
241 sgvdqlveii kvlgtptreq iqsinnpnyke fkfpqikahp wnkvfrvhtp aeaidliski 
301 ieytptsrpt pqaacqhaff delrapdarl psgrplptle mdgpmgtgei sptsgdvagp 
361 sa 
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sgg-1 (GSKt3) inhibits constitutive SKN-1 nuclear 
accumulation and induction of its target gene gcs-1. 



B 



Control [g/o-7(RNAi)] 



yii.'.f U ■ i... . : 




«•» ?3Ki .*bs • 1 r. r s; .■» - 




Control fefo-i(RNAi)] 




sgg-/(RNAi) 



•mm 



1 ^ -« ."CTcMftLi*" 



*gg-i(RNAi) 
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Ala substitution at a predicted GSK-3 phosphorylation 
site results in nuclear localization of SKN-1 



A. 



lTJG 



T68 
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r1\ 
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SKN-1 



393 

TTDSSSTCSRLSSES 



p-Catenin SGIHSGATTTAPSLS 



P P P 

\ \ / 

GSK-3 



P 

t 

CK1 



Peptides 



1. WT 



2. WT+P397 



3. S393A+P397 



393 397 
DCTTDSSSTCSRLSSESPRYTSE 

DCTTDSSSTCSRLSSESPRYTSE * 

DCTTDSSSTCARLSSESPRYTSE 



Assay: 




WT 

WT+P397 
S393A+P397 



+ 
+ 



+ 
+ 
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